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recent  report  by  Parasuraman  et  al  (1990).  Among  the  issues  they  raised  were  the 
effects  of  different  types  of  adaptive  automation  on  performance,  the  adaptive 
logic  used  to  implement  task  changes,  pilot  consent  to  suggested  adaptive  changes, 
the  effects  of  automation  failure,  euid  so  on.  Etspirlcal  evidence  relevant  to  these 
issues  (from  simulator  or  field  studies)  is  still  meeiger.  The  present  studies 
form  part  of  a  program  of  Investigations  of  these  and  other  human-performance 
Issues  related  to  cockpit  automation  (Parasuraman  et  al.,  (1991) •  Here  we  briefly 
report  the  results  of  two  experiments  investigating  the  effects  of  shifts  in  the 
type  and  level  of  automation  on  operator  performance  in  a  multi-task  environment. 
The  first  study  examined  the  benefits  and  costs  of  adaptive  automation  shifts  on 
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automation  reliability  on  operator  detection  of  automation  failures. 
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EFFECTS  OF  SHIFTS  IN  THE  LEVEL  OF 
AUTOMATION  ON  OPERATOR  PERFORMANCE^ 

Raja  Parasuraman,  ToirfUc  Bahri,  Robert  MoUoy,  and  Indramani  Singh 
Cognitive  Science  Laboratory,  The  Catholic  University  of  America 

INTRODUCTION 

The  introduction  of  intelligent  automadon  in  advanced  cockpits  has  raised  many  human- 
factors  issues.  Although  benefits  have  accrued  from  automation,  potential  costs  have  also  been 
noted  (Norman  et  al.,  1988;  Wiener,  1988;  see  Parasuraman  et  al.,  1990,  for  a  review).  For 
example,  automation  may  adversely  affect  a  pilot's  system  awareness  and  ability  to  revert  to 
manual  control  (Chambers  &  Nagel,  1985).  Over-reliance  on  automated  systems  nuiy  also  lead  to 
pilot  "complacency"  (Wiener,  1981). 

Adaptive  automation  ^  has  been  proposed  as  an  automation  concept  that  may  preserve  the 
benefits  of  automation  while  minimizing  its  costs  (Rouse,  1988).  In  adaptive  systems  automated 
aids  are  invoked  dynamically  in  response  to  changing  mission  demands.  Such  systems  are  thought 
to  be  superior  to  conventional  automation  because  they  provide  for  regulation  of  operator 
workload,  maintenance  of  skill  levels,  and  system  involvement  (Hancock  &  Chignell,  1988; 
Parasuraman,  1987).  At  present,  however,  empirical  evidence  for  the  efHcacy  of  adaptive 
automation  is  lacking  (Parasuraman  et  al.,  19SK)). 

The  application  of  adaptive  systems  to  cockpit  automation  was  discussed  in  a  recent  report 
by  Parasuraman  et  al.  (1990).  Among  the  issues  they  raised  were  the  effects  of  different  types  of 
adaptive  automation  on  performance,  the  adaptive  logic  used  to  implement  task  changes,  pilot 
consent  to  suggested  adaptive  changes,  the  effects  of  automation  f^ure,  and  so  on.  Empirical 
evidence  relevant  to  these  issues  (rfom  simulator  or  Held  smdies)  is  still  meager.  The  present 
studies  form  part  of  a  program  of  investigations  of  these  and  other  human-performance  issues 
related  to  cocl^it  automation  (Parasuraman  et  al.,  1991).  Here  we  briefly  report  the  results  of  two 
experiments  investigating  the  effects  of  shifts  in  the  type  and  level  of  automation  on  operator 
performance  in  a  multi-task  environment.  The  first  study  examined  the  benefits  and  costs  of 
adaptive-automation  shifts  on  operator  performance.  The  second  study  examined  the  effects  of 
variations  in  automation  reliability  on  operator  detection  of  automation  failures. 

CXIST/BENEFTT  ANALYSIS  OF  SHORT-CITCLE  ADAPTIVE  AUTOMATION 

Adaptive  automation  involves  transitions  between  levels  of  automation.  The  frequency  of 
these  transitions  may  vary  between  two  extremes.  In  long-cycle  adaptive  automation,  a  particular 
flight  function  may  be  automated  for  long  periods  of  time,  then  be  carried  out  manually  for  a  short 
period,  and  then  revert  for  another  long  period  to  automated  control.  At  the  other  extreme,  in 
short-cycle  adaptive  automation,  flight  functions  are  cycled  between  manual  and  automated 
control  more  frequently,  particularly  if  the  adaptive  logic  is  sensitive  to  small  changes  in  task 
demands  or  pilot  workload.  The  present  study  examined  whether  such  short-cycle  automation 
shifts  benefit  operator  performance  on  flight-related  tasks  and  whether  there  are  any  costs 
associated  with  the  return  to  manual  control  following  adaptive  automatioiL 
Methods 


^Suppcmed  by  contracts  from  NADC,  Warminster,  PA  (John  Deaton,  technical  monitor)  and  NASA  Langley 
Research  Center,  Hampton,  VA  (Alan  Pope,  techni^  monitor).  Addi^  conespondence  to  Raja  Parasuraman, 
Cognitive  Science  Laboratory,  Catholic  University  of  America,  Washington  DC  20064. 

^  Also  referred  to  as  "adaptive  aiding”  and  "adaptive  function  allocation." 
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Twenty  four  young  adults  were  tested  on  a  muldple-task  set  consisting  of  compensatory 
tracking,  system  monitoring,  and  fuel  management  taslb  presented  simultaneously  in  separate 
windows  of  a  14-in  color  monitor  (Comstock  &  Amegard,  1990).  TTie  forcing  function  for  the 
tracking  task  was  derived  from  a  sum  of  sine  waves  whose  highest  frequency  was  .OM  Hz. 
Subjects  had  to  keep  a  circular  target  symbol  in  the  center  of  the  w^ow  using  a  two-axis  joystick 
under  rate  control.  Tbe  performance  measure  was  the  integrated  root  mean  square  (RMS)  error 
averaged  over  a  10-min  block.  The  system  monitoring  task  required  subjects  to  detect  "out-of¬ 
range"  deviations  of  any  one  of  four  meters  with  continuously  moving  pointers  (representing 
temperature  and  pressure).  Accuracy  and  reaction  time  (RT)  were  the  perfcxmance  measures.  Fuel 
management  was  a  complex  task  requiring  subjects  to  maintain  the  level  of  fuel  in  the  two  main 
tanks  of  the  aircraft  at  a  specified  critical  level  by  transfering  fuel  fiom  several  supply  tanks.  Fuel 
transfer  was  accomplished  by  activating  one  or  more  of  several  pumps.  I^mp  flow  rates  varied 
and  occasionally  a  pump  was  deactivated  for  60  sec.  Several  performance  measures  were  used  for 
this  task.  Here  we  report  the  mean  RMS  error  in  fuel  level  deviations  of  Ae  main  tanks.  (See 
Parasuraman  et  al.,  1^1,  for  additional  details  cm  each  task  and  performance  measure.) 

Each  task  could  be  performed  under  manual  or  autcxnation  control.  Under  automation  control 
all  control  inputs  were  disabled  and  the  tasks  were  carried  out  by  software  with  slight  perturbations 
so  that  the  display  windows  were  not  completely  static.  To  simulate  the  workload  associated  with 
"supervisory  control"  of  automation,  occasional  "deviations"  in  the  automation  occurred, 
consisting  of  slow  target  drifts  for  tracking,  delayed  "reaction  times"  for  system  monitoring,  and 
delayed  pump  deactivations  for  the  fuel  management  task.  The  degree  to  which  subjects 
"supervised"  the  automation  could  then  be  roughly  estimated  by  asking  them  to  note  any  such 
deviations  and  repon  them  after  completing  a  block.  To  discourage  subjects  from  active  task 
processing  aimed  at  detecting  such  deviations,  subjects  were  not  queried  after  every  block. 

In  an  initial  practice  session  each  subject  performed  all  three  tasks  manually.  Subsequently, 
each  subject  performed  the  three  tasks  under  both  manual  and  automation  control  in  consecutive 
10-min  blocks  in  the  sequence:  manual  control  (M) — automated  control  (A)— return  to  manual 
control  (RM).  The  M — A — RM  sequence  was  repeated  four  times  with  intervening  rest  periods. 
During  the  automation  phase  one  task  was  automat  and  the  other  two  were  under  manual  control; 
this  simulates  model-driven  adaptive  logic  (vs.  performance-driven  logic;  see  Parasuraman  et  al., 
1990).  Each  of  the  three  tasks  was  automated  individually  for  three  separate  groups  of  six 
subjects.  A  fourth  group  of  six  subjects  was  a  no-automation  control.  During  the  automation 
phase  of  the  other  groups  these  subjects  performed  an  unrelated  single  task  (choice  RT). 

Results  and-Discmsign 

There  were  four  main  sets  of  results.  First,  automation  supervision,  as  assessed  by  the 
number  of  "deviations"  reponed  post-block,  was  adequate  for  tracking  and  nxinitoring  but  less  so 
for  the  fuel  management  task.  Subjects  reported  noting  50%,  40%,  and  25%  of  the  deviations, 
respectively,  for  these  tasks. 

Second,  automation  benefits  (M  vs.  A)  and  costs  (M  vs.  RM)  were  assessed  in  separate  4  x 
2x4  ANOVAs  (automated  task  x  conditions  [M-A  or  M-RM]  x  blocks  [1-4]).  Adaptive- 
automation  benefits  were  obtained.  Tracking  performance  was  facilitated  by  automation  of  the 
monitoring  and  fuel  management  tasks,  F  (1,10)  »  9.15,  p  <  .02,  as  was  monitoring  pofoimance 
by  automation  of  tracking  and  fuel  management,  F  (1,10)  »  4.60,  p  <  .06,  for  accuracy,  F  (1,10) 
=  6.03,  p  <  .05,  for  RT.  However,  while  fuel  management  was  better  under  automation  control 
of  tracking  and  monitoring,  the  effect  was  not  significant  due  to  very  high  inter-subject  variability 
in  the  RMS  error  measure,  F  (1,10)  =  2.34,p  >  .05. 
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forms  when  a  person  first  encounters  a  device  in  a  particular  context  and  which  is  then  reinforced 
when  it  is  re-encountered  in  the  same  way.  Langer  identified  a  number  of  antecedent  conditions 
that  produce  this  type  of  attitude,  including  routine,  repetition,  and  extremes  of  woiidoad — all 
conditions  that  can  occur  in  the  autcnnated  coclqiiL  We  thoefoie  reasoned  diat  automation  that  was 
unchanging  (but  less  than  perfect)  would  be  more  likely  to  induce  a  condititxi  of  oon^lacency  than 
automation  that  varied,  i.e.,  that  complacency  would  a  function  of  fhe  consistency  of  reliability 
of  the  automated  task  in  a  multi-task  environment  We  hypothesized  that  complacency  would  be 
potentially  high  for  a  group  of  subjects  encountering  automation  with  constant,  unchanging 
reliability,  because  this  group  was  more  likely  to  develop  a  premature  cognitive  commitment 
regarding  the  nature  of  the  automatic  and  its  efficiency,  (m  the  (Hher  hand,  subjects  encountering 
inconsistent  automation  reliability  would  have  a  more  open  attitude  concerning  die  efficiency  of 
the  automation  and  hence  would  be  less  likely  to  be  complacent  We  therefore  predicted  that  the 
accuracy  of  operator  detection  of  automation  failure  would  be  greater  for  variable-reliability  than 
for  constant-reliability  automation  of  a  system  nx>nitoring  task.  We  anticipated  that  this  effect 
would  emerge  with  time  spent  under  automation  control,  so  that  the  performance  in  the  two 
conditicms  would  diverge  gradually  from  an  initially  equal  starting  point 

Methods 

Twenty-four  adults  aged  19-43  years  were  tested  on  the  same  multi-task  flight  simulation 
used  in  the  ^t  study.  The  tracking  and  fuel  management  tasks  were  always  porformed  manually 
and  the  system  monitoring  task  always  under  partial  automation  control.  Normally  the  pointers  of 
each  of  the  four  meters  fluctuated  around  the  center  of  the  scale  within  one  limit  in  each  direction 
from  center.  A  green  "OK"  light  above  the  meters  remained  on.  Independently  and  at  intervals 
according  to  a  script,  each  pointer  shifted  its  "center"  position  away  from  the  middle  of  the  meter. 
These  "system  malfunctions"  were  automatically  detected  and  reset  by  the  autotnatitMi  except  when 
the  automation  failed,  in  which  case  the  subject  was  required  to  respond.  When  the  automation 
detected  a  malfunction,  a  red  warning  light  atove  the  meters  came  on  and  the  computer  corrected 
the  fault  in  4  sec.  During  this  time  the  response  keys  for  the  monitoring  task  were  ^sabled.  When 
the  automation  failed,  the  green  OK  light  remained  on  and  the  subject  was  responsible  for 
detecting  and  correcting  the  malfunction  by  pressing  one  of  the  four  function  keys  for  each 
meter.  Feedback  was  provided  on  correct  responses  by  the  pointer  of  the  relevant  meter  moving 
immediately  back  to  the  center  point  and  remaining  there  without  fluctuating  for  a  period  of  1.5 
sec.  If  a  malfunction  was  not  identified  (whether  by  the  automation  or  by  the  subject),  the  fault 
was  corrected  10  sec  from  the  beginning  of  its  occurrence.  Sixteen  malfunctions  were 
programmed  to  occur  at  unpredictable  intervals  within  each  10-min  block.  Malfunctions  appeared 
in  a  quasi-random  sequence  with  the  restriction  that  an  equal  number  of  malfunctions  occurred  for 
each  meter  (four)  and  for  each  direction  of  center  shift  (eight). 

Automation  reliability  (the  percentage  of  malfunctions  correctly  detected  by  the  automation  in 
each  10-min  block)  was  varied  as  a  between-subjects  factor.  For  the  constant-reliability  condition, 
reliability  was  constant  across  blocks,  whereas  for  the  variable-reliability  condition,  reliability 
varied  from  block  to  block.  In  the  constant-reliability  condition,  automation  reliability  was  "high" 
for  half  the  subjects  (87.5%;  14  out  of  16  malfunctions  detected  by  the  computer)  and  "low"  for 
the  other  half  (56.25%;  9  out  of  16  malfunctions  detected).  Thus,  for  each  10-min  block,  the 
subject  had  to  intervene  for  2  malfunctions  in  the  high-reliability  condition  and  7  malfunctions  in 
the  low-reliability  condition.  These  levels  remained  constant  across  each  10-min  block.  In  the 
variable-reliability  condition,  reliability  alternated  every  10  min  from  low  (56.25%)  to  high 
(87.5%)  for  half  dbe  subjects  and  frxxn  high  to  low  for  the  otiier  half  of  the  subjects. 

Every  subject  completed  four  sessions,  each  consisting  of  three  continuous  10-tnin  blocks. 
In  all  conditions  the  automation  routine  was  shut  off  for  the  last  20  min  of  session  four  (blocks  1 1 
and  12)  in  order  to  simulate  catastrophic  automation  failure.  Subjects  were  trained  and  given  10 
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Figure  1.  Effects  of  shon-cycle  automation  of  tracking  on  monitoring  accuracy  and  RT. 

Third,  practice  generally  reduced  automation  benefits.  As  an  example,  see  Hg.  1  for  the 
monitoring  task.  Tracking  automation  increased  monitoring  accuracy  but  the  benefit  dissipated 
with  blocks.  While  a  performance  ceiling  could  have  limited  the  benefit  in  accuracy,  monitoring 
RT  had  not  reachol  a  floor  in  the  later  blocks  in  which  no  automation  benefit  was  obtained. 

Fourth,  no  evidence  of  sustained  automation  cost  was  obtained.  Mean  performance  levels 
on  the  M  and  RM  blocks  did  differ  for  some  conditions,  but  not  significantly,  and  were  not 
significandy  difierent  to  the  corresponding  M  and  RM  blocks  for  the  no-automadon  group. 

These  results  confirm  that  short-cycle  adaptive  automation  can  have  beneficial  effects  on 
performance,  as  found  previously  with  conventional,  nonadaptive  automation  (e.g.,  Tsang  &. 
Johnson,  1989).  No  evidence  was  obtained  for  impaired  efficiency  associated  with  the  return  to 
manual  control  following  automation.  Training  and  practice  reduced  the  automation  benefit,  but 
extended  performance  did  not  produce  any  automation  costs.  Although  preliminary,  these  findings 
are  encouraging  with  respect  to  the  positive  effects  of  adaptive  automation.  It  remains  to  be  seen, 
however,  whether  these  l^nefits  persist  with  Iong-c>’cle  adaptive  automation  and  whether  there  are 
any  costs  associated  with  long-term  automation  transitions. 

OPERATOR  MONITORING  OF  AUTOMATION  FAILURE 

High  levels  of  automation  increase  the  demand  on  the  operator  to  monitor  for  possible 
automation  failure  (Chambers  &  Nagel,  1985;  Parasuraman,  1987).  Monitoring  ^uirements  may 
also  increase  if  adaptive  systems  come  into  use  because  many  automatic  transitions  occur  in  such 
systems.  For  these  and  o^er  reasons,  the  effects  of  different  characteristics  of  automation  on  the 
ability  of  the  opmtOT  to  monitor  automated  systems  need  to  evaluated. 

Pilot  "complacency"  is  often  mentioned  as  representing  one  potential  effect  of  automation 
relevant  to  monitoring  p^ormance  (Thackray  &  Touchstone,  1989;  Wiener,  1981).  Although  the 
term  remains  poorly  defined  (but  see  Singh  et  al.,  1991),  complacency  is  one  of  several  behavicval 
coding  categories  used  to  classify  aircraft  incidents  in  NASA’s  Aviation  Safety  Reporting  System 
(ASRS).  llie  ASRS  Coding  Manual  defines  "complacency”  as  "self-satisfaction  which  may  result 
in  non-vigilance  based  on  an  unjustified  assumption  of  satisfactory  system  state”  (Billings  et  al., 
1976). 

We  examined  the  role  of  "automation-induced  complacency”  by  investipting  the  effects  of 
automation  reliability  on  operator  monitoring  of  system  failures  during  multi-task  performance. 
Longer  (1989)  proposed  the  concept  of  premorure  cognitive  commirmenr  to  refer  to  a  mindset  that 
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Egure  2.  Effects  of  automation  reliability  on  t^rator  (left)  and  system  (right)  monitoring  pcrfonnance. 

min  of  practice  on  all  three  tasks  performed  under  manual  control.  They  were  asked  to  give 
equal  attention  to  all  three  ta^Ms  dunng  training  and  practice.  At  the  end  of  the  training  session, 
performance  feedback  on  all  three  tasks  was  given.  The  experimental  phase  began  with  subjects 
being  informed  that  the  monitoring  usk  would  be  automated,  so  that  they  should  focus  their 
attention  on  the  tracking  and  fuel  management  tasks.  However,  they  were  informed  that  the 
automation  routine  was  not  completely  reliable  and  that  they  were  ret^uired  to  supervise  the 
automation  and  detect  and  respond  to  any  malfunction  that  the  automation  failed  to  detecL 

Results  and  Discussion 

We  predicted  that  initially  the  accuracy  of  manual  intervention  in  the  automated  monitoring 
task  should  be  roughly  the  same  for  the  two  experimental  groups,  but  that  lat^  the  performance  of 
the  variable-reUabtiity  group  would  be  superior  to  that  of  the  constant-reliability  group.  The  results 
supponed  these  pre^ctions.  The  efficiency  of  manual  intervention  in  ^e  automated  monito^g 
task  was  significantly  higher  in  the  variable-reliability  condition  than  in  the  constant-reliability 
condition  (see  Fig.  2,  left).  The  performance  adyanuge  for  the  variable-reliability  group  was 
present  early  and  grew  in  size  across  the  12  10-tnin  blocks.  The  effect  of  automation  reliability 
(constant  vs  variable)  was  not  significant  in  Block  1,  F  (1,40)  ■  2.67,  p  >  .05,  but  was.  significant 
for  all  subsequent  11  blocks,  F  (1,40)  >  8.99,  p  <  .01,  Detection  accuracy  for  *e  variable- 
reliability  group  reached  a  high  level  quickly,  exceeding  80%  after  block  4,  but  remained  at  a  low 
level  for  the  constant-reliability  group.  Following  the  total  automation  failure  in  block  11, 
performance  of  the  constant-reliability  group  improved,  but  still  did  not  recover  to  the  level  of  the 
variable-reliability  ^oup.  Combined  system  (i.e.,  human4comput»)  nwnitoring  performance  (cf. 
Thackray  &  Touchstone,  1989)  was  also  superior  with  variable-reliability  automation,  particularly 
following  total  automation  fulure  (Fig.  2,  right).  'Dris  suggests  that  werator  complacency  could 
limit  system  performance  following  total  automation  failure  even  ir  automation  reliability  was 
much  higher  (e.g.,  between  99%  and  1(X)%)  than  the  "high”  value  of  87.5%  used  in  this  study . 

The  performance  advantage  in  monitoring  accuracy  was  not  a  function  of  speed-accuracy 
tradeoff  because  RT  was  unaffected  by  automaoon  reliability.  Initial  performance  level  was  also 
not  a  factor  because  both  groups  had  similar,  high  detection  rates  under  manual  control  Moreover, 
there  were  no  significant  differences  between  the  two  subgroups  of  the  cons^t-reliability 
condition,  indicating,  somewhat  surprisingly,  that  the  absolute  level  of  automation  reliability, 
within  the  range  snitUed  here,  did  not  affea  the  panem  of  results.  Thu  shows  that  the  increase 
performance  of  the  variable-reliability  group  was  not  a  function  of  a  higher  "ngnal  rate"  afforded 
by  less  reliable  automation.  Finally,  there  were  no  significant  differences  in  tracking  or  fuel- 
management  performance  between  automation  reliability  conditions,  showing  that  the  perfornumce 
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advantage  of  the  variable-reliability  group  was  not  due  to  their  allocating  processing  resources 
away  from  tracking  or  fuel  management  to  system  monitoring. 

These  results  provide  what  we  believe  is  the  first  empirical  evidence  of  the  performance 
consequences  of  "complacency,"  unconfounded  by  other  factors  such  as  speed-accuracy  tradeoff, 
initial  performance  capabilitities,  signal  rate  effects,  differential  allocation  of  processing  resources 
to  other  flight  tasks,  or  experimenter  effects  (all  subjects  received  the  same  instructions). 
Performance  consequences  of  complacency  in  system  nx>nitoring  were  demonstrated  to  be  related 
to  characteristics  of  the  monitoring  task  automation,  namely  automation  reliability  and  amsistency. 
In  our  ongoing  research  we  are  examining  how  other  features  of  automation,  including  adaptive 
automation,  influence  operator  efficiency  in  mcmitoring  for  automation  failure. 
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